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MAXIMA OF A RANDOMIZED RIEMANN ZETA FUNCTION, 
AND BRANCHING RANDOM WALKS 

LOUIS-PIERRE ARGUIN, DAVID BELIUS, AND ADAM J. HARPER 


Abstract. A recent conjecture of Fyodorov-Hiary-Keating states that the maximum of 
the absolute value of the Riemann zeta function on a typical bounded interval of the critical 
line is expjloglogT — | log log log T + 0(1)}, for an interval at (large) height T. In this 
paper, we verify the first two terms in the exponential for a model of the zeta function, 
which is essentially a randomized Euler product. The critical element of the proof is the 
identification of an approximate tree structure, present also in the actual zeta function, 
which allows us to relate the maximum to that of a branching random walk. 


1. Introduction 

The Riemann zeta function is defined for Re(s) > 1 by a sum over integers, or equivalently 
by an Euler product over primes, as 

OO . 

(1) = n 

n=l p primes 

and by analytic continuation for other complex s. The behaviour of the function on the 
critical line Re(s) = 1/2 is a major theme in number theory, the most important questions 
of course concerning the zeroes (e.g. the Riemann Hypothesis). 

This paper is motivated by the study of the large values of |C(s)| on the critical line 
s = 1/2 + it. Little is known about the behavior on long intervals, say 0 < t < T for T 
large. The Lindelof hypothesis, which is implied by the Riemann hypothesis, states that 
maxo<t<T |C(l/2 + *^)| grows slower than any small power of T. See the paper of Farmer, 
Gonek and Hughes na for more precise conjectures about this maximum size, and the 
paper of Soundararajan [25] for a rigorous lower bound. More recently, Fyodorov, Hiary and 
Keating considered the maximum on bounded intervals of the critical line. They made the 
following conjecture: 

Conjecture (Fyodorov-Hiary-Keating [TTl 115] ). For r sampled uniformly from [0,T], 

(2) max log |C(l/2 + i(r + h))\ = loglogT - ^ logloglogT + Op(l) , 

/ie[o,i] 4 
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where Op(l) is a term that is stochastically bounded as T ^ oo. 

The main result of this paper is a proof of the validity of the hrst two terms in ([^ for a 
random model of C dehned in ([^ below, which is essentially a randomized Euler product. 
Until now such precise estimates were not known rigorously even for models of zeta. 

The conjecture is intriguing for many reasons. From a number theory point of view, 
the precision of the prediction is striking. From a probability point of view, the leading and 
subleading order of the maximum correspond exactly to those of the maximum of a branching 
random walk (which is a collection of correlated random walks indexed by the leaves of a 
tree), as will be explained below. In fact, the key element of the proof for the random model 
will be the identification of an approximate tree structure for the zeta function. 


1.1. Modelling the zeta function. If we take logarithms and Taylor expand the Euler 
product formula for the zeta function, we find for Re(s) > 1, 

°° 1 _ 1 _ 1 

(3) log c(s) = - iog(i - p-*) = = ^(1)’ 

since the total contribution from all proper prime powers with fc > 2) is uniformly 
bounded. One of the great challenges of analytic number theory is to understand how the 
influence of the Euler product may persist for general s G C. The definition of our random 
model is based on a rigorous result in that direction, assuming the truth of the Riemann 
Hypothesis, proved by Harper [H] by adapting a method of Soundararajan [2B] (which itself 
builds heavily on classical work of Selberg pi]). 


Proposition 1.1 (See Proposition 1 of Harper [IS])- Assume the Riemann Hypothesis. For 
T large enough there exists a set Ft F [T, T + 1], of measure at least 0.99, such that 


( 4 ) 


log|C(l/2 + R)| = Re 


1 log(T/p) \ 
logT j 


+ 0 ( 1 ) 


Vf e n. 


The set FL produced in Proposition |1.1| consists of values t that are not abnormally close, 
in a certain averaged sense, to many zeros of the zeta function. It seems reasonable to think 
that one shouldn’t typically find maxima very close to zeros. Moreover, if one only wants 
an upper bound then the restriction to the set Ft can in fact be removed, at the cost of 
a slightly more complicated right-hand side. Therefore, to understand the typical size of 
maxo</i<i log |C(l/2 -l- *(t -|- h))\ as r varies we should try to understand the typical size of 


maxo<fe<i Z]p<T 


log(T/p) \ 


The factor log(T/p)/ log T is a smoothing introduced 


jpKT ^pl/2 + i{T-l-h) logT 

for technical reasons. For simplicity we shall ignore it in our model. 

Since the values of logp are linearly independent for distinct primes, it is easy to check by 
computing moments that the hnite-dimensional distributions of the process (p”*’’^, p primes), 
where r is sampled uniformly from [0,T], converge as T —)■ cx) to those of a sequence 
of independent random variables distributed uniformly on the unit circle. Following [I9] . 
this observation suggests to build a model from a probability space (hi, H, P) with random 
variables {Up,p primes) which are uniform on the unit circle, and independent. For T > 0 
and h G M, we consider the random variables view of Proposition 
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1.1[ the process 

(5) 


\p<T 


5 ^,.. [ 0 , 1 ] 


seems like a reasonable model for the large values of (log \({l/2 + i{T + h))\, h E [0,1]). 

1.2. Main Result. In this paper, we provide evidence in favor of Conjectureby proving a 
similar statement for the random model ([^. At the same time, we hope to outline a possible 
approach to tackle the conjecture for the Riemann zeta function itself. 

Theorem 1.2. Let {Up,p primes) be independent random variables on (hi, P), distributed 
uniformly on the unit circle. Then 


( 6 ) 


max > 
/ie[o,i] ^ 

^ p<T 


Re{UpP ) ^ iQg 2 ^ _ ^ iQg iQg iQg _|_ op(iog log log T), 


p 


1/2 


where the sum is over the primes less than or equal to T and the error term converges to 0 
in probability when divided by log log log T. 


An outline of the proof of the theorem is given in Section 1.4 below. The technical tools 
needed are developed in Section and dually the proof is given in Section 

1.3. Relations to previous results. The leading order term log log T in ([^ was proved 
in [in], where it was also shown that the second order term must lie between —2 log log logT 
and —(1/4) log log log T. As well as giving a stronger result, our analysis here is ultimately 
based on a control of the joint distribution of only two points hi and h .2 of the random 
process at a time, which could feasibly be achieved for the zeta function itself. In contrast, 
the lower bound analysis in [19] depends on a Gaussian comparison inequality that requires 
control of log T points. 

Fyodorov, Hiary and Keating motivated Conjecture in HI [T5] using a connection to 
random matrices. There is convincing evidence, see e.g. [20], that the values of the zeta 
function in an interval of the critical line are well modelled by the characteristic polynomial 
Pn{x) of a.n N X N matrix sampled uniformly from the unitary group, for x = on 
the unit circle. In this spirit, they compute in [TH] the moments of the partition function 
Zn{P) = |P/v(e*^)|^(i6*. They argue that these coincide with those previously obtained for 


a logarithmically correlated Gaussian held [I3]. For large (3, this leads to the conjecture that 
the maximum of the characteristic polynomial behaves like the maximum of the Gaussian 
model. Unfortunately, the analogue of Gonjecture for this random matrix model is not 
known rigorously even to leading order (see [28] for recent developments at low fd and its 
relation to Gaussian multiplicative chaos). The conjecture is also expected to hold for other 
random matrix models such as the Gaussian Unitary Ensembles, see [16]. One advantage of 
the model (|^ is that it can be analysed rigorously to a high level of precision with current 
probabilistic techniques. 


As explained in Section 1.4 the proof of Theorem 1.2 uses in a crucial way an approximate 
tree structure present in our model and also in the actual zeta function. This structure 
explains the observed agreement between the high values of the zeta function and those 
of log-correlated random helds. The approach to control subleading orders of log-correlated 
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Gaussian fields and branching random walks was first developed by Bramson |9] in his seminal 
work on the maximum of branching Brownian motion. It has since been extended to more 
general branching random walks by several authors, e.g. and to log-correlated 

Gaussian helds, see for example uni 123. This type of argument can also be applied to 
obtain the joint distribution of the near-maxima, see e.g. ME]. Recently, a multiscale 
rehnement of the second moment method was introduced by Kistler in [2T] to control the 
leading and subleading orders of processes with neither a priori Gaussianity nor exact tree 
structure. It was successfully implemented in j5| to obtain the subleading order of cover 
times on the two-dimensional torus. The proof of Theorem follows the same approach. 

It is instructive to consider the conjecture in the light of the statistics of typical values of 
the zeta function. One beautiful result is the Selberg central limit theorem [21], which asserts 
that if r is sampled uniformly from the interval [0, T] then loglogT)”^/^ log |C(l/2 -l- ir)] 
converges in law to a standard Gaussian variable. Thus, to obtain a rough prediction for 
the order of the maximum on [0,1], one may compare it to the maximum of independent 
Gaussian variables of mean 0 and variance ^ log log T. For logT such variables, it is not hard 
to show that the order of the maximum is log log T— ^ log log log T -|-0(1). The leading order 
agrees with Gonjecture[T| but the constant in the subleading correction is different. Our proof 
shows how to modify this “independent” heuristic to account for the “extra” — | log log log T 
present in Gonjecture[^ Bourgade showed a multivariate version of Selberg’s theorem where 
the correlations are logarithmic in the limit [8]. However, the convergence is too weak to 
describe the maximum on an interval. 


1.4. Outline of the proof. The proof of Theorem |1.2| is based on an analogy between the 
process ([^ and a branching random walk (also known as hierarchical random held). We 
make this connection precise here, and indicate for an unfamiliar reader how to analyse the 
maximum of a branching random walk. 

We will work in the case where T = for some large natural number n. In this setup, 
the process of interest in Theorem 1.2 is 


(7) 


{Xn{h),he[o,i]) 


where Xn{h) = 


Re{UpP 


p<e^ 


p 


1/2 


is a continuous function of h. Since loglogT = ?7,Iog2 and logloglogT = logn-l- 0(1), 
Theorem Ol can be restated as: 


( 8 ) 

( 9 ) 


lim P 

n^oo 


mn(—e) < max XXh) < mnis) 
/ie[o,i] 


where mn{s) = n log 2 — - log n + e log n . 


1, for all £ > 0, 


In other words, with large probability, the maximum of the process lies in an arbitrarily 
small window (of order log n) around n log 2 — | log n. 

By symmetry of Up we have E[X„(h)] = 0 for any h G [0,1]. Also a simple compu¬ 
tation shows that E[Re([/pP“*^)Re(f/pP“*^')] = (l/2)cos(|h — h'|logp), so the covariance 
¥\Xn{h)Xn{h')\ equals | X]iogp< 2 " cos(|h — h'\ logp). Using well known results on primes 
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(cf. Lemma 2.1), it is possible to estimate this as 


( 10 ) 




for any h, h' G [0,1] provided \h — h'\ > 2“"'. If instead \h — h'\ < 2“"', then the covariance 
is almost n(log2)/2, i.e. X^ih) and X^ih') are almost perfectly correlated. Therefore, one 
can think of the maximum over h G [0,1] as a maximum over 2” equally spaced points. 

The key point of the proof is that the logarithmic nature of the correlations can be under¬ 
stood in a more structural way using a multiscale decomposition. Precisely, we rewrite the 
process as 


( 11 ) 

is the 
large. 


Xn{h) = '^Yk{h), where Yk{h) = ^ 


Re{UpP 


—ih\ 


k=0 


2^~i<logp<2^ 


p 


1/2 


increment at “scale” k of Xn{h). It is not hard to show, see Section 2.1, that for k 


( 12 ) 


E [Yu{hf] 


log 2 
2 


and E[n(h)n(h')] 



if \h-h'\ < 2-^ 
if \h-h'\ > 2-^ 


In view of (12), for given h, h', one can think of the partial sums Xk{h) = '^j=iYj{h) 


and Xfc(h') = 'Ylij=iYj{h') as random walks, where the increments Yj{h),Yj{h') are almost 
perfectly correlated (so roughly the same) for those j such that 2^ < \h — and where 

they are almost perfectly decorrelated (so essentially independent) when 2^ > \h — 

A similar, but exact, behaviour would be obtained as follows: Consider 2” equally spaced 
points in [0,1], thought of as leaves of a binary tree of depth n. Place on each edge of the 
binary tree an independent Gaussian with mean zero and variance (log2)/2, and associate 
to a leaf the random walk given by the partial sums of the Gaussians on the path from root 
to leaf, see Figure [T] With this construction, the hrst k increments of the random walks of 
two leaves will be exactly the same, where k is the level of the most recent common ancestor, 
and the rest of the increments will be perfectly independent. This tree construction is an 
example of branching random walk. For the model ([^ of zeta, the branching point k where 
the paths Xk{h) and X^iji') roughly decorrelate is 


(13) 


h A h' = [log2 \h — h'\ 


So h and h' correspond to leaves whose most recent common ancestor is in level k = h Ah'. 
We note that the different nature of the correlations for different ranges of p was already 
exploited in early work of Halasz [TH] , although without drawing any connection to branching. 

A compelling method to analyse the maximum of a branching random walk and of log- 
correlated processes in general is a multiscale refinement of the seeond moment method as 
proposed in 1211 , which we implement to the approximate branching setting described above. 
Naively, one could hrst consider the number of variables whose value exceeds a given value 
m, i.e. the number of exceedances, 


(14) 


Z{m) = #{j < 2- : X„(j72-) > m} . 
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Figure 1. (Left) An illustration of the correlation structure of a branching 
random walk. (Right) A realization of two paths of an approximate branching 
random walk with increments almost equal before the branching point h A h' 
and almost decoupled after. The barrier below which the paths must stay is 
also shown. 


Clearly, maxj< 2 « X„(j/2"^) > m if and only if Z{m) > 1. 
maximum can be obtained by the union bound 

(15) 


Thus an upper bound for the 


P(Z(m) > 1) < E[Z{m)] = 2’"P(X„(0) > m) 


On the other hand, a lower bound can be obtained by the Paley-Zygmund inequality, 

E[Z(m)]2 


(16) 


¥{Z{m) > 1) > 


E[Z(m)2] ■ 

More precisely, one would choose m = m{n) large enough in ( |I5) ) so that E[Z{m)] = o(l), and 
m small enough in (16) so that E[Z]2 = (1 + o{l)yE[Z\ and thus E{Z{m) > 1) = 1 + o(l). 
For this one needs large deviation estimates: if we think of X„(h) as Gaussian with variance 
n log 2/2, then a standard Gaussian estimate yields that P(X„(h) > m) is approximately 
:^g-m 2 /{(iog 2 )n)_ Thus 2"'P(X„(0) > m) = o(l) when m = (log2)?7, — |logn + elogn. 
This would in fact be the correct answer (the union bound would be sharp) if the random 
variables X„(j/2”) were independent. However, if m = (log2)n — |logn + elogn, then 
2 mp(x^( 0 ) > > cn^~^ —)• oo, so (|T^ cannot prove the upper bound we seek in Theorem 


0 Similarly, the right-hand side of (|16[) will tend to zero unless m < since strong 

correlation between exceedance events for nearby h, h' inflates the second moment. Thus the 
lower bound obtained is not close to what we seek even to leading order. 

To get tight bounds, one needs to modify the definition of the number of exceedances 
using an insight from the underlying approximate tree structure. For branching random 
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walk there are exactly 2^ distinct partial sums up to the fc-level, one for each vertex at that 
level. By analogy one expects that the “variation” in Xk{h) (i.e. in the partial sums up to 
the fc-th level) for different h G [0,1] should be captured by just 2^ equally spaced points 
in [0,1]. Even if they were independent, it would be very unlikely that one of these 2^ 
values exceeded k log 2 + 5, for i? > 0 growing slowly with n, and it turns out that positive 
correlations only make it less likely. This can be proved using elementary arguments, cf. 
Lemma 3.4 In other words, with high probability, all random walks Xk{h) must lie below 
the barrier k ha /clog2 + B. This suggests to look at the modihed number of exceedances 


(17) 


Z(m) = #0 < 2” : A'„(j72”) > m,XtU/2") < *:log2 + B ,Vk < n} . 


It turns out that replacing Z by Z in the hrst moment bound (15) and (with slight modih- 
cations) in the second moment bound (16) will yield the correct answer. To see this in the 


former case, we write the hrst moment by conditioning on the end point: 

(18) W[Z{rn)] = 2"'P(X„(0) > m) x P(Xa;( 0) < /clog2 + B ,V/c < n|X„(0) > m) . 

By the earlier naive discussion, the hrst two terms amount to when we set m = 

n log 2 — I log n + e log n. The third term is the probability that a random walk bridge 
starting at 0 and ending at m = n log 2 — | log n + e log n avoids the barrier k log 2 + B. This 


probability turns out to be n as shown by the ballot theorem, cf. Lemma 2.12 Therefore, 


E[Z(m)] = 0{n ^) —)■ 0, for all £ > 0. A similar analysis can be done for the lower bound, 
where we have the obvious inequality F{Z{m) > 1) > F{Z{m) > 1). The extra barrier 
condition turns out to reduce correlations between exceedance events sufficiently so that the 
second moment is now essentially the hrst moment squared when m <n log 2—| log n—e log n 
(this indicates why the second moment of Z(m) is too large: in the exponentially unlikely 
event that a path manages to go far above the barrier, it has exponentially many “ohspring” 
that end up far above the typical level of the maximum). 

The form of the subleading correction is thus explained by the extra “cost” n~^ of satisfying 
the barrier condition. And the barrier condition arises because of “tree-like” correlations 
present in the values of (the model of) the zeta function. This suggests the possibility that 
the partial sums of the Euler product ([^ of the actual zeta function behave similarly where 
the zeta function is large. 

To prove Theorem El we must address the imprecisions in the above discussion. The 
necessary large deviation estimates are derived in Sect ion 2.1 The claim that Xk{h) does 
not vary much below scale 2“^ is proved in Section 2.2 using a chaining argument. Another 


issue is that our process is not an exact branching random walk because increments are never 
perfectly independent (for different h, h') nor exactly identical. To deal with this, we use a 


Berry-Esseen approximation in Section 2.3 to show that the random walks are very close to 
being Gaussian. This allows for an explicit comparison with Gaussian random walks with 
i.i.d. increments and “perfect” branching. Moreover, to get a sharp lower bound with the 
second moment method, it is necessary to “cut off the hrst r scales” and consider 


(19) Xr,k{h) = Xk{h) - Xr{h) for heR, 

for an appropriately chosen r. Finally, it should be stressed that our approach relies only on 
controlling hrst and second moments, which means that the estimates we need only involve 
at most two random walks simultaneously. 
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2. Preliminaries 


Throughout the paper, we will write c for absolute constants whose value may change at 
different occurrences. A sum over the variable p always denotes a sum over primes. 

2.1. Large Deviation Estimates. In this section, we derive the large deviation properties 
of the increments (Yfc(h),h G [0,1]) and their sum. We hrst derive basic facts on their 
distribution and in particular on their correlations. 

Recall that the random variables {Up,p primes) are i.i.d. and uniform on the unit circle. 
For simplicity, we denote the p-th term of the sum over primes in ([^ by. 


( 20 ) 




p 


Note that the law of the process {Wp{h), h G M) is translation-invariant on the real line and 
also invariant under the reflection h ^ —h. A straightforward computation using the law of 
the Up's and translation invariance gives 


( 21 ) 


E [Wp{h)Wp{h')] = — cos(|h — h'\ logp), for all h, U. 

Zp 


In this notation, the increments dehned in ( pT| are 

( 22 ) Yk{h)= Wp{h),he 


2^ ^<logp<2^ 


Using (21) and the independence of the Up’s, the variance of Yk{h) becomes 

1 

^ ’ 


(23) 


crl = Var(yi(ft)) = 

2^“i <logp<2^ 

and the covariance of Yk{h) and Yk(h') is. 


(24) 


pt(h,h')^'B.[\l(h)Yt{h')]= Y 


2^~i<logp<2^ 


— cos(|h — h'\ logp). 
Zp 


The next lemma formalizes (12), giving bounds for how close the variance of the increments 
is to 

(25) cr2 = (log2)/2, 

and for h ^ h', how close the covariance is to the variance before the ’’branching point” 
h A h\ defined in (13), and how fast it decays after. 

Lemma 2.1. For h, h' G M and k > 1, 

(26) al=E [Yk{hY] =a^ + o(e 


—cV^ 


(27) puiKh') = E[Yu{h)Yu{h')] 



Note that in both cases the error term decays exponentially in k. 


if k < h A h', 
if k > h Ah'. 
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Proof. We use a strong form of the Prime Number Theorem (see Theorem 6.9 of 
states that 

(28) . 


which 


r 

< X : p prime} = 


By replacing the sum Y.p<p<Q p 
parts, one obtains 


logn 

- with the integral ^ 


du + 0{xe 


—c-^/log a:'\ 


P u log u 


du using (28) and integration by 


- = log log Q — log log P + 0{e fQj, all 2 < P < Q. 

P<P<Q ^ 


This together with (23) yields (26). Similarly (28) implies that 


Pk{Kh') = - 


cos(|h — h'\ logu) 
u logu 


du + O {{1 + \h — h'\)e 


—cy/2^~^ 


When 2^\h — h'\ = < 1, the claim ( [^ follows by using that cos(|h — h'\ logu) = 

1 + 0{\h — h'\‘^{\oguY). When 2“^|h — h'\~^ = < 1, we use integration by parts. 

After the change of variable v = logu, the integral becomes 


sin(|h — h'\v) 


+ 


sin(|h —h'|n) 

\h — h'\v 2 *-! ' J 2 k-i \h — h'\v‘^ 

Both terms are 0(2“^|h — 


du . 


□ 


Remark 1. A similar but easier argument using (28) shows that 


(29) 


l^SiiP^OHlogQD, foraUl<P<Q. 

P<p<Q ^ 


The main results of this section are explicit expressions for the cumulant generating func¬ 
tions of the increments, from which we will deduce large deviation estimates. For fixed 
h, h' G M, we will often drop the dependence on h and h' when it is clear from context and 
dehne 

Yu={Yu{h),Yk{h')) . 

The covariance matrix of is then denoted by 

E, = Cov(yo 

The eigenvalues of Sa, are cx^ ± pa:- 

The cumulant generating functions are 

(30) ^A) = logE[exp(Alfc)] = logE[exp(A -Yk)] , 


where A G M, A G and “ • ” is the inner product in M^. The following change of measure 
will also be needed in the proof of Theorem |1.2 


(31) 


^Qa 


dF 


n 




for A G M, 


dQx 

dP 


n 




for A G 
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Recall that in the univariate case, 

(32) 

and in the multivariate case, 

(33) QaAJ = VVf (A), 




(P 


VarQ>(n) = . 


CovQ^(yt) = Hess i/'f’lA) . 


The results also provide bounds on these quantities. We hrst state the result for the uni¬ 
variate case. The proof is omitted since it is a special case of the multivariate bound in 


Proposition 2.4 


Proposition 2.2. Let C > 0. For all 0 < X < C and k large enough (depending on C), the 
cumulant generating function (A) satisfies 


(34) 


(^) = 




+ 0 e 


Moreover, for such k, the measure Qa in (31) satisfies 

(35) Qa [Yk] = Xal + 0 (e~^'^ ' j , Varq^ [W] = al + 0 (^e~‘^’° ' j . 

One useful consequence of the proposition is a one-point large deviation estimate, which 
after being strengthened to a bound for the maximum over a small interval, will be a crucial 
input to the proof of the upper bound of Theorem 1.2 (see (44) and (74)). Recall from (19) 
that Xr,k{h) = Xk{h) - Xfih) = Y.tr+1 yii.h). 

Corollary 2.3. Let C > 0. For any t] < r < k — 1, Q < x < C{k — r) and h G M, 


(36) 


P [Xr^k (h) > x] < cexp 


X 


2{k — r)a^ 


where the constant c depends on C. 

Proof. Using the exponential Chebyshev’s inequality, the probabili ty i n 
above by exp =r+i W(-^) ~ > 0. By Proposition 

place of C), we get that if A < IOC, 

,2 ^ 


2.2 


is bounded 
(with, say, IOC in 


P [Xr^k {h) > x] < exp I c + — erf — Ax -|- 0{t 


-c2^ 


l=r+l 


< cexp y 2 ^ CT; - Ax c cexp I — (fc - r) a - Ax 1 , 


Z=r-|-1 


where for I too small for (34) to be applied, we simply use that t/’z(A) is bounded. Setting 
A = x[{k — r)(j^) < IOC gives the result. □ 


We now prove the bounds in the multivariate case. 
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Proposition 2.4. Let C > 0. For all X = (A, A'), where 0 < A, A' < C, and k large enough 
(depending on C), the cumulant generating function (A) satisfies 


(37) 

Moreover, for such k, the measure 

(38) Qa [Yk] = SfcA + O (e 
Proof. We first compute 




m 

_2fe-i 


satisfies 

) and CovQ, [Yk] = + 0{e-^"~'). 


(39) 


= logE[exp(AWp(0) + A'Wp(|/i-/i'| 


= log ^ {fm oos(^) + ^ cos(0 + \h- h'\ logp)) dO. 

Recall that for any a, 6 G M, 


r- 27 r 


(40) 


— / exp(a 008 ( 0 ) + bsm(9))d9 = + 6^), 

271 ./n 


where denotes the n-th modihed Bessel function of the hrst kind. The identity cos(0+?7) = 
cos(0) cos{'r]) —sin(0) sin(r 7 ) can be used together with (40) to write the integral in the bottom 
line of (39) as 

Jo (A + cos(|h - h'\\ogp)X'f + ^ (sin(|h - h'\\ogp)X'f^ 

= h [^Jl (A2 + 2AA'cos(|h - h'\\ogp) + A'2)) = (^2A ■ M^X) , 

where 

1 cos(|/i-h'llogp) 

^ 2p \ cos(|h — h'l logp) 1 


is the covariance matrix of {Wp{h), Wp{h')), see ( pl| ). Thus writing 
(42) /(x) = log/o(v^), 

we have (A) = / (A ■ MpX). Recall that Jq (a:) has Taylor expansion Io{x) = 1 + ^ + 
+ 0(x®) (which can be verihed by expanding in ([40|)), so that / has Taylor expansion 


(43) 


m = ^-^ + 0{x^). 


Now since the random variables Up are independent, 

V'f(N= E E /D-AfpA), 

2^“i<logp<2^ 2^“i<logp<2^ 

The bound ( |4^ implies that for k large enough (depending on C), 

V'f(A)= (iA.MpA + 0(p-")) =iA.StA + 0 

2^“i <logp<2^ 


_ 2 fc 


This proves ( |37[ ). 

The hrst claim of (38) follows similarly after noting that the gradient of the map A —)■ 
/(A ■ MpX) is MpXf (A ■ MpX), and using the bound f'{x) = ^ + 0{x), valid for x G [0,1]. 
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Finally the second claim of (38) follows by noting that the Hessian of the aforementioned 
map is 

Mpf{\ ■ Mp) + {MpX){MpXff (A ■ MpX), 

and using the previous bound for /'(x), and that f'\x) is bounded in [0,1]. □ 

2.2. Continuity estimates. The main result of this section is a maximal inequality which 
shows that the maximum over an interval of length 2“^ of the held Xr^k{h) is close to the 
value of the held at the mid-point of the interval, where Xr^k{h) is dehned in (19). One of 
the upshots is to reduce the proof of the upper bound of the maximum of the process on 


[0,1] to an upper bound on the maximum over a discrete set of points in Section 3.1 

Proposition 2.5. Let C > 0. For any 0 < r < k 
and h G M, 


1, 0 < a: < C (A; — r), 2 < a < 2 


2k 


X 


(44) P 


max Xr kih') > X + a, Xr k (h) < X 
h':\h'-h\<2->=-^ 


< cexp 


X 


2{k — r) cr^ 


- ca3/2 


where the constants c depend on C. 

The proof of the proposition is postponed until the end of the section. It is based on a 
chaining argument and an estimate on joint large devi ations of Xr^k{h) and of the diherence 
Xr,k{h') — Xr^k{h) for \h' — h\ < see Lemma 2.7 below. The exponent of the a term is 


probably not optimal. A direct consequence of the proposition is the following large deviation 
bound of the maximum of Xk {h) over an interval of length 2~^. 

Corollary 2.6. Let C > 0. For any < r < k — 1, hGM and <x <C {k — r), 


(45) 


P 


max Xr^kih') > x 

h':\h'-h\<2-^-^ 


< cexp 


X 


2{k — r) cr^ 


where the constant c depends on C. 


Proof. The left-hand side of (45) is at most 


P 


max Xr^k (h') > (x — 2) + 2, Xr^k (h) < x — 2 


-|- P [Xr^k (h) > X — 2] . 


The bound follows by (44) with a; — 2 in place of x and a = 2, and the bound (36) 

intervals of length 2 


□ 


Remark 2. A union bound over 2"' 
(46) P 


max Xn (h) > (1 -|- 5)nlog2 
he[o,i] 


< c2 


yields 

for all 6 >0, 


where (45) is used with r = 0 and k = n (note that Xn{h) = Yo{h) + Xo^n{h) and Yq is 
bounded). This proves that max/igp^j (h) is at most (1 -|- o(l))nlog2, which is tight to 
leading order, but does not include the subleading correction present in (|^ and §, 


To prove Proposition 2.5 we will use the following large deviation estimate for Xr,k (0) and 
the difference Xr^k (^ 2 ) ~ ^r,k {hi) (jointly), where \h 2 — hi\ < 2~^. It shows that on a large 
deviation scale the two quantities are essentially independent, and that the difference decays 
rapidly with 1^2 — hi |. The latter is a consequence of the covariance of the held ^ (h) losing 
its log-correlation structure below scale 2“^, and instead decaying linearly with distance. 
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Lemma 2.7. Let C > 0. For any t)<r<k — \,t)<x<C{k — r),t)<y< 2^^ and any 
distinct < hi, h 2 < 


cy 


3/2 


2{k — r) cr^ 2 ^\h 2 — hi\ 


(47) P [Xr^k (0) > X, Xr,k {h 2 ) - Xr,k {hi) > y] < cexp 
where the constants c depend on C. 

Proof. Observe first that we may assume y is bigger than a large constant depending on 
C times 2^|/i2 — hi\, (and therefore also bigger than a large constant times 2^^|h2 — hip), 


because otherwise the required bound follows from (36). 


For any Ai, A 2 > 0, the left-hand side of (47) is bounded above by 

(48) E [exp (AiX^^fc (0) A 2 {Xr^k (^ 2 ) - Xr,k (hi)))] exp (-Aix - A 2 I/) • 
We will show that if Ai < IOC* and 1 < A 2 < |h 2 — hi|“^, 

E [exp {XiXr^k (0) -|- A 2 {Xr^k {^ 2 ) — Xr^k (hi)))] 

(49) ^ 


< cexp I -r) + cA 22 ''|h 2 - hi\ + c(A 22 ''|h 2 - hi|)' 

The result then follows by choosing Ai = x {{k — r)a‘^)~^ and A 2 = cy^^'^ 2~'^\h2 — hi\~^ 


in (48) and (49), for a suitable small c, and using our assumption that y is bigger than a 


large constant times 2^|h2 — hd. Note that the assumptions on x,y,hi and h 2 ensure that 
Ai < IOC and 1 < A 2 < |h 2 - hil’h 


We now prove (49). First we note that similarly to the argument from (39) to (41), 

(50) E [exp {XiWp (0) + A 2 {W^ (h 2 ) - (hi)))], 

can be written explicitly as 

( 51 ) _^ 

lo f yi(Ai + (cos(/i2logp) - cos(/ii logp))A2) + t((sm(/i2logp) - sin(/ii logp))A2) 


Recall from (43) that log/o(v^) = S'lid that cos(h 2 logp) — cos(hilogp) = 

0(|h2 — hi| logp) and sin(h 2 logp) — sin(hi logp) = 0(|h2 — hi| logp). Thus provided Ai < 
IOC, 1 < A 2 < |h 2 — hi|“^ and p is large enough, the logarithm of the quantity in (50) is at 
most 


(52) 


1 c 

— (Ai cA 2 |h 2 - hi| logp)^ -(A 2 |h 2 - hi| logp)^ cp~^ 

4p p 

< ^ + -A 2 |h 2 - hi| logp+ - (A 2 |h 2 - hi| logp)^ cp~‘^. 

Ap p p 


Here we used the fact that Ai < IOC. After summing over 2^ < logp < 2^ we get that 
logE [exp (AiX^^fc (0) + A 2 {Xr,k (h 2 ) - Xr^k (hi)))] 

logPx 1; , I , (logp)^ 


<c+ 


a; 

~A - ^ ^ 

4p ' p 

2'^<\ogp<2^ 2^<logp<2^ 


E 


-A 2 |h 2 — hi| -|- c ^ ^ 


2’'<logp<2* 


p 


(A2|h2-hi|)^ 


In the above, if p is too small for (52) to be an upper bound, we simply use that (50) is 
bounded. The claim (49) now follows from the bounds (26) and (29). □ 
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We are now ready to prove Proposition 2.5 We will use the following notation: for k gN, 


let 


(53) 


T-Lk be the set of dyadic rationals, so that "Hq C "Hi C ... C Tik C ... C 
is a nested sequence of sets of equally spaced points and {Hk H [0,1)| = 2*^ 


Proof of Proposition \2. 5} Without loss of generality, we may assume h = 0. We can also 
round x up and decrease a so that we may assume that x is an integer and a > 1. Dehne 
the events 

Bq = {Xr,k (0) e [x - q - l,x - q]} ,q = 0,1,. .. ,x -1, and Sj. = {Xr^k (0) < 0} . 

Note that the left-hand side of (liil) is at most 


(54) 


q=0 


BqH < max {Xr,k {h') - Xr,k (0)} >a + q 
' h'eA 


where A = [—2“^“^, 2“^“^]. Let {hi,i > 0) be a dyadic sequence such that ho = 0, h, G 
PLk+i n A and limj_^oo h* = h', so that |hi+i — hi\ E {0, 2“^“*“^} for all i. Because the map 
h I—)■ Xr^k{h) is almost surely continuous, 

OO 

Xr,k{h') — Xr^A:(0) = — Xr^kihij) . 

j=0 

The right-hand side converges almost surely, since X]!=o (^r,A:(hi+i)—W,fc(hi)) = Xy.^k{hi+i) — 
-^r,fc(0) —)■ Xr^k{h')—Xr^kifl)I because X^^kih) is continuous almost surely. Since 2 {i+iy — 
1, we have the inclusion of events. 


{Xr^k {h') — Xr,k (0) > a -|- g} C < Xr^k (hj+i) — Xr^k {hi) 


i=0 


> 


a + q 

2{i + iy 

This implies that {max/^/g^ {Xr^k {h') - Xr,k (0)) >a + q} is included in 

a + q 


U U 

* ^ hi E PLk+i hi A, 

h2 = hi± 2 -*^-*-! 


Xr,k (^2) X^^k (hi) ^ 


2{i + l) 


2 f ’ 


a+q 


where we have ignored the case hi = h 2 since then event < X^^k (h 2 ) — (hi) > .. .2 
the empty set. Because \Pik+i H Al < c2*, the g-th summand in (54) is at most, 

OO r ^ N - 


IS 


sup P 

i=o hi E PLk+i n A, 
h2 = hi ± 2-*^-*-! 


BqP < Xr^k (h2) — Xr^k (hi) > 


2{i + l)^ 


Note that a-|-g<a-|-x< 2^^ by assumption. The inequality (47) can thus be applied to 
get that ( |2.2[ ) is at most 

n2 , „a3/2' 


OO 

c ^ 2* exp 

i=0 


{x — q — 1)^ 
2{k — r)a‘^ 


cT 


{a + qy 

(^ + 1) 


3 / — 
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Since e < g most 

< ce~ 


ce 


-co?!"^ -(3;-g-l)^/(2(fc-r)(T2)-cg3/2 ^ l(^2{k-r)(7^)-ca^^'^ pC(g+l)- 


q=0 


E- 

q=0 


-cq 


3/2 


< ce 




where we used the assumption x < C{k — r). This proves (44). 


□ 


2.3. Gaussian approximation. The purpose of this section is to compare the increments 
Yk{h) to Gaussian random variables with mean and variance independent of k, both for 
a single h G M and for vectors (Tfc(hi), ^^(^ 2 )) for hi 7 ^ /i 2 G M. This will be used in 
the subsequent sections to apply the ballot theorem and derive bounds on the probability 
that Xr^k{hi) and Xr^k{h 2 ) satisfy a barrier condition. One reason to pass to Gaussian 
random variables is that the standard ballot theorem provides such bounds for random 
walks with i.i.d. increments. It does not immediately apply to the process k 1 —)■ Xr^k{h), 
whose increments Yk{h) have slightly different distributions for different k. Moreover, we 
need to show that the increments Yk{hi) and Yk{h 2 ) for two points hi fy h 2 become roughly 
independent when k is beyond the branching point hi A h 2 , cf. (13). To quantify this, we 


introduce a parameter A and refer to the scale hi A h 2 + A as the decoupling point. Passing 
to Gaussian variables facilitates the proof of the decoupling, since in the Gaussian case we 
can investigate independence solely by controlling the covariance and the mean. 

Our main tool is the following multivariate Berry-Esseen approximation for independent 
random vectors. For the remainder of the paper, will denote the Gaussian measure with 
mean vector pu and covariance matrix S. 

Lemma 2.8 (Gorollary 17.2 in [6], see also Theorem 1.3 in [IT]). Let {Wj,j > 1) be a 
sequence of independent random vectors on (M'^, P) with mean E \Wj] and covariance 

matrix CovfWj). Define 

m m 

pim = '^E[Wj] and ^ Cov(W. 

i=i i=i 

Let \m be the smallest eigenvalue ofT^rn CLi^d Qm be the law ofW i + ■ ■ ■ + Wm- 
There exists an absolute constant c depending only on the dimension d such that 


sup 

AeA 


< c\^l^ Y. W) - E\W, 
i=i 


where A is the collection of Borel measurable convex subsets o/M'^ 


Before stating the results, we recall the notation from Section 2.1] is the product 
measure from (31) and for fixed hi,h 2 G M, we write = (41 (hi), 17 (h 2 )), A^r,n = 


{Xr^n (hi) ,Xr^n (^ 2 ))- We show that beyond the decoupling point hi Ah 2 + A, the increments 
under are close (in terms of A) to being independent Gaussians with mean Acr^ and 
variance fy = (log 2 )/ 2 . 
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Proposition 2.9. Let A G M and A > 0 . Let hi, /i2 G M, m > A /12 + A and /x = Acr^. 

For any eonvex subsets C k = m + 1,... ,n, we have 

(55) 

[Xm,k e Ak'im <k <n\ 


= (1 + 0(e-^^)) jt/ G M2x(n-m) ^ Vfc = 1,...,n - + 0(e-^^") , 

where denotes the product measure (on 2{n — m) independent Gaussians each with 

mean y and variance 

Proof. Recall that Yk = Z) 2 '=-l^p< 2 '= where Wp = {Wp{hi) ,Wp{h 2 )). The proof 
has two steps. First, Lemma |2 8| is applied successively for each k from k = n down 
to /c = m + 1 to pass to a Gaussian measure. Then we explicitly compare the resulting 
Gaussian measure ®k=m+infi^ Sj. (fhe product of (n — m) bivariate Gaussians with means 

fik = yk (1,1) = (Qa [Yk{hi)] ,Qa [Yk{h 2 )]) and covariance matrices Sfc = Govq^ [^fc]), to 
the decoupled measure 


Gonditioning on the values of Yj for all m + 1 < j < n — 1, then applying Lemma 2.8 to 
the Wp with 2"'“^ < logp < 2"', and finally integrating over Yj we obtain 


Xm,k e Ak 'im < k < n] 


n—1 


( 56 ) - ^A X ^ Yj e Ak^m < k <n-2, ^ G n (A„ - yn) 

j'=m+l j=m-\-l 

< E Id • 

<logp<2"' 


where yn is sampled from rj^ A„ is the smallest eigenvalue of and An — yn is the 
set An translated by yn- Since an intersection of convex sets is convex, the lemma can be 
applied in the same way to the VPp’s contributing to Yn-i, Yn- 2 , and so on. The resulting 
estimate is then 
(57) 

£ At\lm < k < n]- | ll 2 x(n-m) . ^ y, £ A,, Vk ^ m + 1,... ,n 




^ " E E Ol^P - Qa [^p] f ] • 

k=m+l 2''-l<logp<2'' 

For k > hi A h 2 + A, the eigenvalues A^ are uniformly bounded away from 0. Indeed, observe 


that by (38), and the discussion preceding (30), and Lemma 2.1 


\k = al- pk + 0 {e ^ j = + 0(e + e > c> 0, 













MAXIMA OF A RANDOMIZED RIEMANN ZETA FUNCTION 


17 


for A large enough but fixed. Also by construction, the norm of the vector Wp is bounded 
by Hence the error term in (Is^ is bounded by 


(58) 


< ce-‘ 


c p s ce 

2m<iogp<2" 


< e 


It remains to compare the measure ®k=ra+iVfi^.,tk with the measure The specihcs 

of the considered event play no role at this point, so we write B for a generic measurable 
subset of We show 

(59) = (1 + 0(e-<‘-'“‘"'=l)) |B] + 0(e-''“-‘‘“>>) , Vfc > m. 

Together with 


and (57), this implies the proposition since the estimate (59) can be 


applied successively integrating in each coordinate to get for any A C 

n n 


k=m+l 


k=m-\-l 


= (1 + 0(e-'^))r,; + 0(e-“) , 


To prove (59), we compare densities. Proposition 2.4 and Lemma 2.1 give 

(60) pfc = p + Sfc = + , 

where 1 is the 2x2 identity matrix, using that fc > m > hi A ^2 + A. Consider the set, 

Ek = {y : \\y - AJI < . 

A straightforward Gaussian estimate yields 




2^{k—hiAh2)l2 




< e 


_gc(/c —h,pAh.2) 


and similarly for p^^ 2 [E^]. Therefore, it suffices to prove (59) for B C E^. The density of 
hAj,,Sfc with respect to Lebesgue measure is, 

(61) 


3-(y-Afc)-Sj, h?/-Afc)/2 


By (60), 


27r(detSfc)h^ 

(det Sfc)-i/2 = (x- 2 (i + 


Furthermore for all ^ G M^, 

{y - i^k) ■ ^k\y - Afc) = - i^kf + iy- K) ■ {^k^ - w^i)(2/ - /i^). 


By (60) and the dehnition of Ek, the error term is 

(h - M ■ (A' - <^-T)(h - . 

Thus, on Ek, the density (61) equals (l + 0(e“dt-tiA/i2)j^_^g-||i/-A*ll^/2 particular, 

%kik\B] = (1 + 0(e-'l‘-'"*'‘>>)) for any B c Ek. 

It remains to compare the densities of and We have that 

{y - hf = {y- yf + (yk - yf - 2(y - y){yk - y) • 
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The second term is 0(2 ] 3 y (QQ). The third term can be estimated using the fact 

that \y — /ifcl = 

\{y - y){h - y)\<{\y- M + W - y\)\yk - y\ = _ 

This implies that on B <Z 

VSJB] = (l + 0(e-'=l‘-‘-*'‘=>)) . 


This concludes the proof of the claim (59). 


□ 


The next proposition provides a Gaussian comparison before the branching point. The 
proof is omitted, as it follows the previous one closely, with fi replaced by 2Xa^ in (60). 

Proposition 2.10. Let A G M and A > 0. Let hi, /i 2 G M, m < hi A h 2 — A and fi = 2A(T^. 
For any convex subsets C M^, k = m + 1,... ,n, we have 
(62) 

e Afc Vm < fc < Tz] 

k 


= {l + 0[e-^^)) <: 


X 2{n—m) 
2 


y e 


)X2(n—m) . 


yj G Ak+m v/c = 1,..., n - m| + 0(e 
i=i 


A one-point Gaussian approximation for the measure Qa from (31) will also be needed. 
The proof is again similar to the proof of Proposition |2.9| and is omitted. One noticeable 
difference is in (60) where the covariance estimate is replaced by al = ) because of 

(26). The additive error 6“®'" is then replaced by . The multiplicative error 1-1-0 
becomes 1 -)- 0(e“®''"*), and can thus be “absorbed” in the additive error. 


Proposition 2.11. Let A G M, h G S 

Ak F'R,k = m + 1,... ,n, we have 

Qa [Xm,kih) e Ak Wm < k < n] 
(63) _x(n—m) 


= 


y e 


) X (n—m) 


0 < m < n and y = Aa^. For any convex subsets 


: ^ 2 /i G Ak+m Wk = 1,... ,n - + 0{e . 

i=i 


2.4. Ballot theorem. The ballot theorem provides an estimate for the probability that a 
random walk stays below a certain value and ends up in an interval. We state the case we 
need, which is that of Gaussian random walk with increments of mean 0 and variance cx^. 

Lemma 2.12. Let be a Gaussian random walk with increments of mean 0 and 

variance > 0, with Xq = 0. Let h > 0. There is a constant c = c(a,S) such that for all 
a>0,b<a — 6 and n > 1 


(64) P [Xn G {h,h + h) and Xk < a for 0 < k < n] < c 

Also provided h < 1, 


(1 -|- n)(l “1“ o — 5) 
)^372 ■ 


(65) 


cn 


3/2 


< P [Xn G (0, 5) and Xk < 1 for 0 < k < n]. 
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Proof. Note that {Xk)o<k<n has the law of {<jBk)o<k<n, where {Bt)t>o is standard Brownian 
motion. Thus we see that the probability in (64) conditioned on = y can be written as 


the probability that a Brownian bridge avoids a barrier at integer times. The bound (6.4) of 
[27] shows, after shifting by a/cr and reflecting, that this condidional probability is at most 
c(l + a/cT)(l + {a — b — 5)/a)/n. Noting that P[Xn G (&, & + 5)] < cn~^B then yields (64). 
In a similar fashion the display below (6.4) in [27| gives (65). □ 


3. Proof of Theorem 11.21 
In this section, we prove (1^, that is. 


( 66 ) 


lim P 

n^oo 


fnn{—s) < max Xn(h) < rrinis) 
he[o,i] 


= 1, for all e > 0. 


This proves Theorem 1.2 for the subsequence T = e^", n G N. The extension of the argument 
to general sequences T follows by trivial adjustments. We will need to consider the process 


Xr,nih) with the hrst r scales cutoff, see (19). Throughout this section we use 
(67) r = [(loglogn)^J. 

First we show that the difference between max/igjo^] Xr,n (h) and max/igjo^] (h) is negligible 
compared to the subleading correction term. 


Lemma 3.1. For all c > 0, 

( 68 ) 

(69) 


lim P 

n^oo 


max Xn (h) > rrin (2e), max Xm (h) < nin-ri^) 
/ie[o,i] /ie[o,i] 


lim P 

n—^oo 


max Xn (h) < rrin (—2e), max Xm {h) > ’R-n-r (~^) 
/ie[o,i] /ie[o,i] 


= 0 , 

= 0 . 


Proof. The event in the probability in (68) implies max/jgjo^] Xj. (h) > (log 2) r+e log (n — r) > 
100 (log2) r, where the last inequality holds for n large enough. But (46), with n = r, gives 


P 


max Xr (h) > 100(log2)r 
/xe[o,i] 


< 2"^^^ ^0, asr^oo. 


Since the laws of max/igjo^] (h) and — min/igjo^] (h), coincide we also have that the 
probability P [min/^gjo^j X^ (h) < —100(log2)r] tends to 0 as r —)■ cx), which similarly implies 

d^. ’ □ 


In the proof of (66) we will use a change of measure under which the process Xr^n has an 
upward drift of 

rrin-ri^) log 2 - f log(n - r) + e log(n - r) 


(70) 


7‘(£) = 


n — r 


n — r 


We use the following consequence of ([^ and (25) several times, 
(71) 


n — r 
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3.1. Proof of the upper bound. In this section we prove the upper bound part of (66). 
By Lemma 3.1, it suffices to prove the following upper bound for max/ig[o,i] (h). 

Proposition 3.2. For all £ > 0, 


(72) 


lim P 

n^oo 


max Xrn (h) > rrin-ris) 
he[o,i] ’ 


= 0 . 


The hrst step is to reduce the proof to a bound on the maximum over the discrete set 


Firi n [0,1] (as dehned in (53)) using the continuity estimates from Section 2.2 
Lemma 3.3. For all £ > 0, 


(73) 


lim P 

n—^oo 


max Xrn (h) > ’R-n-r (2e), max Xm (h) < rrin-ri^) 
/7e[o,i] /ieR„n[o,i] 


= 0 . 


Proof. Using translation invariance and a union bound on 2"^ intervals, the probability in 
(73) is at most 


2”P 


max ^ Xr^n (^) — ^^n—r (2^) , Xr^n (0) Fi r(^) 
/t. I I 2 ^ 


Proposition 2.5 can be applied with k = n, x = rrin-ri^) = {n — r)^[e) and a = run-rif^s) — 
nin-ri^) = (n — r) < 2“^^ — x. This gives the upper bound 


(74) 


c2"' exp ( — {n — r] 


2a2 


— (log (n — 


Using (0 and we get that ( [74l ) is at most 

c2^ ( 2 ^-^ {n - r)g-cU/hiog(n-r))3/2 j = 0 ( 1 ). 


□ 


The second step is to show that for each h G [0,1] the process k —)• Xr^k (h) satishes a 
barrier condition with very high probability. This simply requires a union bound together 
with continuity estimates. 

Lemma 3.4. For all e > 0, 

(75) lim P \3h G [0,1], A; G { [lognj^,..., n| s.t. Xr^k (h) > {k — r)n{e) + (logn)^l = 0. 

Proof. By two successive union bounds, hrst over the scales k = [lognj^, ... ,n, and then, for 
each of those scales, over 2^ intervals (together with translation invariance), the probability 
in (75) is at most 


2^F 

Ai=[log nj2 


max Xrk (h) > (k — r)^ie) + (logn)^ 
h:|h|<2-'=-l 


The maximal inequality (45) can be applied since the right-hand side of the inequality in 
the probability is less than a constant times {k — r). Thus the sum is bounded above by. 


c ^ 2^ exp 

fe=[log nJ2 


((fc-r)/i(£) + (logn)^)' 
2{k — r) 
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Using (0 the argument in the exponential is at least 


{k — r) log 2 — - log (n — r) + c (log nf . 


We conclude that the probability in (75) is at most 


n 


k= [log n\ 2 


□ 


Lemma 3.3 and Lemma 3.4 show that max/igp^j W,n {h) exceeds mn-r (2^) only if, for 
some h G "Hn H [0,1], Xr^n (h) exceeds mn-r{£) and the process {Xr^k (h), [lognj^ < k < n) 
stays below a linear barrier. The number of h G Tin that manage this feat is 


(76) 


= E/xew„n[o,i] where 

J+ (h) = {Xr,n (h) > nin-rie), Xr,k (h) < {k - r)/i(e) + (logn)^ Vfc > [lognj^} 


We show P [Z+ > 0] < c2^ (logn)^ (n — thereby proving Proposition 3.2 since the 

right-hand side is o(l) by the definition ( [6^ of r. Here we shall use the previous Gaussian 
approximation results and the ballot theorem. 

Proposition 3.5. For all c > 0, 

(77) P [Z+ > 0] < E [Z+] < c2^ (logn)® {n - r)"^" . 

Proof. By translation invariance and linearity of expectation, we have E [Z+j = 2"'P [J+ (0)]. 
We show that 


(78) 


P [J+(0)] < c2’-" (logn)” (n - , 


thus yielding (77). To prove ( [78| , let A = /i(e)/cr^, and recall the dehnition of Qa from (31). 
We have that 


(79) 


P [^+(0)] < Qa [^^(0)] 


because Xr,n (0) > {n — r)ij,{e) on the event J~^ (0). Using the estimates (34) and (26) we 
get that 


(80) 


X) d'* W - ^(" - ’•)/!(£) = -(n - r)^^ + O (e . 


k=r+l 


By (0, the exponential in ( [7^ is thus at most c2^ '^(n — r )2 It remains to show 
(81) Qa [<7^(0)] < c (logn)® (n — . 

The event J’''(0) takes the form in Proposition 2.11 with m = r. Thus Qa [<7’'"(0)] is at most 

where 

k n—r 

Ui = I?/ G ~ - r, ^(y; - fi{e)) > o|. 


(=1 


1=1 
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After recentering, the probability of Ei is simply 
(82) ' 


|y e K” < (lognfvk > [lognj'^ - > o} 

1 = 1 1 = 1 


By conditioning on ^ Vi ~ bound the above by the supremum over 

q G [—(logn)^, (logn)^] of Pg ^ 2 ”*'*°^"'^ H-^ 2 ) + where 

k n- [log nj 2 

(83) ^2 = {?/G : ^ 2 /z < (logn)^ - g V/c > 0, ^ yi>-q]- 


1=1 


1=1 


This is because of the standard Gaussian bound 

(log n)^—r 

y e 

Z =1 


x((logn)2-r) 

V0,<t2 


-r , 

: ^ 1 /z < -(logn)^l < cexp (-c 


(logn)' 


(logn)" 


For a given g, the probability of the event in (83) may be bounded above by a union bound 
over a partition of [—g, (log n)^ — g] into intervals of length 1, and the ballot theorem (Lemma 


2.12). This gives an upper bound for (82) of 


n 11 ., (l + (logn)^-g) ( 2 (logn)^) 3/2 

sup (lognj X ^ < c(iog? 7 ,) (n — r) ' . 


— (log n)^<q< (log n)2 


This proves (81), and thus also (78) and (77). 


□ 


3.2. Proof of the lower bound. In this section, we prove the lower bound part of ( 66 ). 
The proof is reduced to a lower bound on max/igfo^] (h) by Lemma 3.1 We show: 

Proposition 3.6. For all £ > 0, 


(84) 


lim P 

n^OQ 


max Xrn(h) > mn-r (—£) 
/ie[ 0 ,i] ’ 


= 1 . 


As for the upper bound, we consider a modihed number of exceedances with a barrier. 
For 5 > 0, let 

J~{h) = {Xr,n{h) G [mn-r{-£),rnn-r{-£) + 5], Xr,k{h) <{k- r)yi{-e) + 1 VA) = r + 1 ,... ,n} 

We omit the dependence on the parameter 5 in the notation for simplicity. Consider the 
random variable, 

h&Hnr\[0,l) 

Clearly, max/jgjo^] Xr^n {h) > '^n-r (—£) if and only if Z~ > 1. The Paley-Zygmund inequal¬ 
ity implies that 

riz- >i)> 

We will prove the following estimates for the hrst and second moments of Z~. Let 


(85) 


n—r L—r 

A = I?/ G G [0,(5], < 1 V/ = r -Fl,... ,n|. 


k=l 


k=l 















MAXIMA OF A RANDOMIZED RIEMANN ZETA FUNCTION 


23 


Lemma 3.7. For <5 > 0, 

(86) E[Z-] > (1 + o (1)) 2"(n - • 


Lemma 3.8. For 5 > 0, 

(87) E[{Z-f] <(l + o(l))(2’'(r!-r)i+^*r,3-''[T)' ■ 


The lower bound (84) follows directly from these two lemmas. 


Proof of Proposition 3^. By the Paley-Zygmund inequality, Lemma 3T and Lemma 3.8 
have 

E[Z-]2 


we 


P 


max Xrnih) > rUn-r (— c) 
he[o,i] ’ 


> P(Z" > 1) > 


El(Z-f] 

The result follows by taking the limits n —)■ oo, then 5 —)■ 0. 


>(l + o(l))e 


-2c<5 


□ 


We now prove the bound on E[Z ]. 


Proof of Lemma 3.7. Translation invariance implies E[Z“] = 2” P[J“(0)]. Consider the 
probability Qa from (31), where A = p,{—e)/a‘^. (By (35) and (26), this choice of A implies 
that Qa [Tfc(O)] is approximately ju(—e).) Since on the event J~ (0) we have that Xr^n < 
{n — r)/i(—e) + 5, the definition of Qa implies that 

(88) P [J"(0)] > Qa [J~ (0)] . 

Proceeding as in ( [^ to estimate Yllt=r+i — ^{n — r)fi {—e), and using 0. we get 

P [^“(0)] > (1 + o(l)) - rf^^+^^Qx [^”(0)] . 

The event J~ (0) is of the form appearing in the Berry-Esseen approximation of Proposition 


2.11 The result can be applied with m = r, and after recentering the increments by their 
mean /i = Aa^ = we get, 

_ X {n—r) 


Qa [J-m=vCr^ [A]+0{e-^-) 


Note that (65) of the ballot theorem (Lemma 2.12) ensures that 

(89) [^4] > c{n - r)"^/^ 

Thus [^4] dominates since r = [(loglogn)^J. This proves the lemma. 


□ 


Remark 3. We note for future reference that the same reasoning (using that Xr^n ^ {n — 
r)n{—e) on J~{0), cf. ([^ ) gives the upper bound, 

(90) P [J"(0)] < (1 + o(l)) 2A^-^){n - rf/‘^+‘^^pAn-r) _ 

To prove the second moment bound in Lemma |3.8| we use the identity 

(91) n{Z-f]= Y. P [J-(/^i) n J-(ha)] . 

,/^^2£'Hnn[o,i) 


We thus seek bounds on P [J~ {hi) fl J~ (ha)] for hi ^ ha. This is the key additional difficulty 
in the lower bound calculation. In essence, these bounds are obtained by conditioning on 
the values of the processes k ha Xr^k{hi), close to the “branching point” hi A ha (defined in 
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(13)), and then applying the following two lemmas. Lemma 3.9 gives an estimate for the 
part of the event before the branching point (where the processes are conpled), and Lemma 


3.10 for the part after (where they are deconpled). To get snfficiently strong conpling and 


deconpling, each estimate must be applied for scales that are respectively slightly before and 
slightly after the branching point. To quantify this, we use for the decoupling parameter A 
the value 

(92) A = r/100 . 

For convenience, dehne the recentered process 

Xr,k (h) = Xr^k (h) -{k-r) ii{-e) . 

Lemma 3.9. Let hi, ^2 G M and I = hi A h 2 . For i = 1,2 and any q > 0, define the event 

(93) Afiq) = {Xr,i-A {hi) e [-q, -q + 1] , Xr,k {hi) < 1 for k = r + 1,... ,l - A] . 

Then for any qi,q 2 > 0, 


(94) 


P [Ai{qi) n ^ 2 (^ 2 )] < c 




{I — A — r 


jj^{l + qi)e^ 


( 41 + 92 ) 


from (31). The 


Proof Let A = fi{—e)/{2a‘^) and A = A(l, 1). We recall the dehnition of 
choice of A ensures that Qa [(Kfe(O)] is approximately fi{—e){l, 1). By the dehnition of 
(95) 

l-A 


IP [^l{(h) (A ^2(<?2)] — 


lAi(iji)nA2((j2) ^ 
i=l,2 


1 (/ij) 


exp “ 2A/i(-£)} , 


\k=r+l 


where lAi(gi)nA 2 (g 2 ) denotes the indicator function of the event. Using Proposition 2.4 as well 
as the covariance estimates (26) and (27), we have that 


l-A 


l-A 


= (^k + Pk + 0{e ')] < X\l-A-r)2a‘^+0{l) = {l-A-r) 


k=r-\-l 


k=r-\-l 


2a2 


+ 0 ( 1 ). 


■ ■ /I \ i-t{ — 

This proves that the second exponential in (95) is at most ce~'' ~ 2,72 . Also on the event 

P^i{hi) F 242 (^ 2)5 the hrst exponential is at most Thus 

(96) P[Ai(gi) n ^ 2 (^ 2 )] < [Afiqi) n ^ 2 (^ 2 )] • 

It remains to bound Qa [Ai(gi) fl A 2 {q 2 )]- In fact, we drop the condition on h 2 and bound 
Qa [Ai(gi)]. We expect not to lose much by this because the behaviour at hi and h 2 should 
be very similar. The event Afiqi) is of the right form to use Proposition 2.10 with m = r 
and n = I — A. After recentering of the increments by p(—e), we get that Qa [Ai(gi)] is 


{l + 0{e-n)h 


X (l—A—r) 

'0,cr2 


^ ^i-A-r . J2i'=i 2 //' < 1 for h = 1,...,/- A - r, 
Lz'=i yi' e -hi + ij 


+ 0{e-^^^) . 


By (64) of the ballot theorem (Lemma 2.12) with b = —qi and h = 1 the probability on the 
right-hand side is at most Together with (96) this proves (94). □ 
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We now prove the bound for scales after the decoupling point. One notable difference with 
the proof of the previous lemma is that the change of measure is now done for a A which is 
twice the one of Lemma [3.91 This reflects the fact that, before the branching point, the two 
processes are essentially coupled, therefore a tilt for one process is also a tilt for the other. 

Lemma 3.10. Let hi, h 2 G M. For any hi A ^2 + A < j < n, and 6,6' > 0, define for i = 1,2 

and q > 0 the events 

(97) _ _ 

Bi{q) ^ {X,,n{hi) - qe [-<5', 5] (hi) - q< 1 for k = j 1,... , 11 } 

'B—j j' 

BM) = [y e -ge [-6',6],'^{yk)i-q < l,Vj' = 1,... ,n - j 


k=l 


k=l 


where y= (( (2/fc)i , (2/^)2 ), ^ = Then for qi,q 2 G 

(98) 


P[i?i(gi)n 52 (^ 2 )] <(l + o(l))e^'' n 

i=l,2 

_ M(-£) \ _ 


I J) ^Bi{qi)\ +e ) 


Proof. Let A = , A = A (1,1) and recall the dehnition of Qa from (31). The choice of A 

ensures that Qa is approximately y{—e) (1,1). The dehnition of Qa gives 


(99) P [5i(gi) n ^ 2 (^ 2 )] = 


^Bi{qi)nB2iq2) ^ 
i=l,2 


j,n i^i) 


hf’(A)-2A(n-jX-£) _ 


By Proposition 2^ (26) and (27), 

(A) = X^ial + Pk) + 0(e-2'“-^) = Xp{-e) + O _ 

We deduce that X)fc=j+i '4’^k\x^) most {n — j) Xp{—e) + c2~^. Therefore, the second ex- 
ponential in ( [^ is (l-|-o(l))e~^^"'~'^^ 2 <t^ . On the event i?i(gi)ni? 2 (q' 2 ), the hrst exponential 

■ ■ -.f /j,( —e) y,{ — e) 

in (99) is at most e“ e~ In view of this, it only remains to show 


( 100 ) 


|B,((/i)n52(92)] < (1 + 0(1)) n 'hX ” Aita)] 

i=l,2 


„cA 


Note that the event Bi{qi) fl B 2 {q 2 ) takes the form considered in Proposition 2.9 Applying 
Proposition 2^ with j in place of m and then recentering yields 

[i?i(gi) n 52(^2)] < (1 + ce-^^) [Biiqi) n B2{q2)] + . 


By ind epend ence, it is plain that [^ 1 (^ 1 ) n 52 (^ 2 )] = ni=i, 2 ho,a 2 [Bi{qi)]- This 

proves (100) and therefore also (98). □ 

The previous lemmas will now be used to prove bounds on P [J~ (hi) fl J~ (^2)] in three 
cases: i) hi A ^2 < r — A, ii) r — A < hi A ^2 < r -|- A, and hi) r -|- A < hi A h 2 < n — A. 


The case hi A h 2 > n, — A is easy and will be handled directly in the proof of Lemma 3.8 


If hi A h 2 < r — A then hi and h 2 are sufficiently far apart so that the scale r is well 
beyond the “branching point” of hi and h 2 , and the events J~ (hi) and J~ { 112 ) decouple: 
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Lemma 3.11. Let /ii, /i 2 G K be such that 1< hi A h 2 <r — A. Then 


( 101 ) 


P[J- (Ai)n J- (ftj)] < (1 + 0 ( 1 )) 


(n — r) 


+ 2£ 


[-4] I . 


where A is the event defined in (85). 


Proof. Let j = r. By assumption we have hi A h 2 + A < j, so Lemma 3.10 can be applied 
with qi = q 2 = 0 and = 0 to give, 

P [J- (fci) n J- (h,)] < (1 + 0(1)) |4] + . 

By (89) and (92) the probability [A] dominates , so the claim follows by (71). □ 

In the case where hi and /i 2 are such that their “branching point” happens after the scale 
r + A, there is no hope of a decoupling of J~ {hi) and J~ {h 2 )- Instead, we need to split 


the probability into a coupled part and a decoupled part and use Lemmas |3.9| and |3.10 
separately. 

Lemma 3.12. Let hi, ^2 G M and I = hi A h 2 . Ifr + A<l<n — A, then 


(102) P [J- (hi) n J- (h 2 )] < c 2^9^+” 


(n-/-A)^(/-A-r)3/2 ' 

Proof. Write {h) = (h) + X;_A,n (h) and decompose the event J~ (hi) fl J~ {I 12 ) 

over the values of (h) as follows 

00 / 2 

i J {hi) n J (h2) n |Xr,i-A {hi) G [—qi, —qi + 1]} 


gi:<?2=o 


i=l 


For hxed qi, q 2 , the event in the union is contained in ni=i 2 (?i) FlC* {qi) , where the events 

Ai{q) are dehned in (93) and for i = 1, 2, 

Ci {q) = {Xi_A,n {hi) G [q-l,q + 5], Xz_A,fc (h^) < g + 1 for h = / - A,..., n} . 

Now note that (Xr,fc (hi))^<^<;_^ , i = 1,2, are independent from (Xz_A,fc (hi))^_^^^^^ , 

1,2. Altogether we get thaf 

00 

(103) P [J- (hi) n J- (h 2 )] < P [7li(gi) n ^ 2 (^ 2 )] P [C'i(gi) n ^ 2 (^ 2 )] • 

91 > 92=0 


I = 


Lemma 3.9 


gives 


P [24i(gi) n ^ 2 (^ 2 )] A c 


3 ^ ' 20-2 


{I — A — r) 


3/2 


(l + gi)e1^(''i+'?^\ 


In order to use Lemma 3.10 , we express the probability on the event Cfs by conditioning on 
X)_A,;+A(^i)) which are independent of Xi+A,n(^i)- We have 


(104) P[Ci(gi) nC2(g2)] = / P[ 5 i(gi - 9 / 1 ) nSi(g2-1/2)] f{yi,y2)dyidy2 
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where f{yi,y 2 ) is the density of (X/_A,i+A(hi),7 = 1,2) , and the events BiS are as in (97) 
with 5' = 1. Lemma 3.10 then gives 


(105) P[5i(gi - j/i) n ^ 2(^2 - 1 / 2 )] < c 


g 2(n I A) TT /I I N -f^^(qr-yi) 


nsing also that [Bi{qi — yi)~\ < c(l + Qi — yi)/ (n — I — by (64) of the ballot 

theorem with ^' + h in place of 6 and b = — y^ — 5'. Thus 


P [J- (hi) n J- (h 2 )] < / n (1 + y 2 )dyidy 2 . 


i=l,2 


To handle the integral, note that Proposition 2.4 implies 

l+A 


(106) 


< cexp 




^fc=i-A+l 


where the last inequality follows from (26) and the inequalities Pfc < cr^ < 2(T^ and jcB < 
2 (see (71)). Using ( [^ , the same estimate holds for E[X;_A,z+A(hi)e^^<^2'(Ei=i, 2 ^!-A,!+A(/*i))j 
and E[ni=i 2 "^^-A,i+A(hi)e^^^'^'"^’'+'^(^‘)]. Altogether this implies 


(107) 


[ n (1+O'* 

*=12 


f{yi,y 2 )dyidy 2 < c(l + gi)(l + g 2 )e^ . 


Thus, equations (103) to (107) yield 


P [J- (hi) n J- (h 2 )] < c2 


16 A 


^_(2(n-/-A)+a-A-r))UAl- 


(/ - A - {n-l- Af 


where we used the fact that q2=o diY {^ + Q 2 ) e is finite. The claim then 

follows from ( [7l| ). □ 

The case where the branching point is between r — A and r + A is handled similarly. 

Lemma 3.13. Let hi, h 2 G M be such that r — A<hiAh 2 <r + A. Then 

(108) P [J- (hi) n J- (h2)] < - r)^" , 

where / := hi A h 2 . 

Proof. Since r —A < I < r + A, we have the decomposition Xr,n{h) = Xr,i+A{h) + Xi+A,n{h). 

by conditioning on Xj.^i^^{hi), i = 1,2, and then drop the 
for both 7 = 1 and 7 = 2. Following (104) and (105), this 

-2(n-Z-A)4^ 


We proceed as in Lemma 3.12 
barrier condition on Xr^i^/^{hi 
gives 


P [J- (hi) n J- (hs)] < c- 


{n-i- xy 


[ n 

i=l 2 


e‘^y^f{yi,y2)dyidy2 , 
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where f{yi,y 2 ) is now the density of {Xr,i+A{hi),i = 1,2). The integral can be estimated 
nsing Proposition 2.4 as in (106). It is smaller than 02^®"^. By (71), the fraction in front of 
the integral is 

Since r — A</<r + A, this is smaller than as claimed. □ 

We now have the necessary two-point estimates to prove the npper bonnd on E[(Z“)^]. 


Proof of Lemma 5A We split the snm in (91) into fonr terms depending on the branching 
point hi A ^2 of the pair hi, ^2 G Pin H [0,1): 

hiAh2<r—A hi,h2'. r—A<hiAh2<r+A hi,h2'. r-\-A<hiAh2<n—A 


E ( 

h\/\h2'>n—A 


“V^ 

(p 


(II) 




~(m 


Using that ffPLu H [0,1) = 2” and the bonnd (101), we get 

(/) < (1 + 0 ( 1 )) {2'(n - r)i+%>=yr’'> Dl)' . 

By (89) the right-hand side is at least c2^^(n — We now show that (//), (7//) and {IV) 
are negligible compared to this, and thns (I) is the dominant term in the snm. Note that 
the nnmber of pairs hi, 7.2 G Hn H [0,1) snch that 2~^~^ < |hi — h 2 | < 2~^ is at most c2^"'“h 
Thns the contribntion of {II), by Lemma 3.13, is at most 

r+A 

{II) <c < c2^^^2^{n - , 

l=r-A+l 

which is negligible compared to 2‘^^{n — r)^^, becanse of the choice A = r/100. Similarly, the 
contribntion of {III) can be bonnded as 

n—A—1 


(///)< Y1 2 


2n—l 


/=r+A+l 


max P [j-(0) n J" (h)l . 

h£[2-l-\2‘] ^ ^ 


Lemma [3. 12| then yields 

{III) < c2^+^^^ 


n 


\4e 


n—A—1 

l=r+A-\-l 
m—2A—1 


{n — r) 


3 /o Z+3A-r ^ 

2 V n — r / 


{n — I — A)3(/ — A — r)^P 


f(2-(a+2A)/m) 

= {n — r)‘^^ ^2 TZ - ^ ^ N,q_ci /9 ) ioT m = n — r 


a=l 


{m — a — 2A)^a^P 


< c2 


r+19A 


(n — r) 


4e 


where the last ineqnality follows from the fact that the snm over a stays finite as m —)■ cxd. 
Since A = r/100, the bonnd on {III) is negligible relative to the bonnd on (/). Finally, for 
{IV), the event J~ (h 2 ) can be dropped. There are at most pairs hi, h 2 G Pin H [0,1) 
snch that |hi — h 2 | < A nnion bonnd nsing the one-point bonnd (90) gives 

{IV) < 2"+^P[J-(0)] < (1 + o(l)) 2'’+^(n - r)2" . 
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Again, this is negligible relative to the bonnd on (J). Therefore 

(/) + (II) + {III) + (IV) < (1 + 0(1)) (2’-(n - r)i+2'<ir'> [A])' . 
which proves the lemma. □ 


This bound on the second moment of Z concludes the proof of lower bound Proposition 


3.6, and therefore also for the main result Theorem 1.2 
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